Data reduction via adaptive sampling
نویسندگان
چکیده
منابع مشابه
Data Reduction via Adaptive Sampling ∗
Data reduction is an important issue in the field of data mining. This article describes a new method for selecting a subset of data from a large dataset. A simplified chi-square criterion is proposed for measuring the goodness-of-fit between the distributions of the reduced and full data sets. Under this criterion, the data reduction problem can be formulated as a binary quadratic program and ...
متن کاملColumn Selection via Adaptive Sampling
Selecting a good column (or row) subset of massive data matrices has found many applications in data analysis and machine learning. We propose a new adaptive sampling algorithm that can be used to improve any relative-error column selection algorithm. Our algorithm delivers a tighter theoretical bound on the approximation error which we also demonstrate empirically using two well known relative...
متن کاملadaptive non‑uniform rate sampling and application in data compression
in this paper the author considers a general method, based on time domain samples for spectral manipulation of time limited signals. first, the original signal is divided into some frames in the time domain. then, by presenting a suitable theoretical and computational algorithm, and using a method for improving the speed of convergence, we find the local bandwidth of each frame; thereby, each f...
متن کاملApplication of adaptive sampling in fishery part 1: Adaptive cluster sampling and its strip designs
Abstract: The precision of conventional sampling designs is not usually satisfactory for estimating parameters of clump and rare populations. Many of fish species live in school and disperse all over a vast area like a sea so that they are rare compare to their habitats. Theory of a class of sampling designs called adaptive sampling designs has rapidly grown during last decade which solv...
متن کاملApplication of adaptive sampling in fishery part 2: Truncated adaptive cluster sampling designs
There are some experiences that researcher come across quite number of time for very large networks in the initial samples such that they cannot finish the sampling procedure. Two solutions have been proposed and used by marine biologists which we discuss in this article: i) Adaptive cluster sampling based on order statistics with a stopping rule, ii) Restricted adaptive cluster sampling. Until...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Communications in Information and Systems
سال: 2002
ISSN: 1526-7555,2163-4548
DOI: 10.4310/cis.2002.v2.n1.a3